Rank aggregation with ties: Experiments and Analysis
نویسندگان
چکیده
The problem of aggregating multiple rankings into one consensus ranking is an active research topic especially in the database community. Various studies have implemented methods for rank aggregation and may have come up with contradicting conclusions upon which algorithms work best. Comparing such results is cumbersome, as the original studies mixed different approaches and used very different evaluation datasets and metrics. Additionally, in real applications, the rankings to be aggregated may not be permutations where elements are strictly ordered, but they may have ties where some elements are placed at the same position. However, most of the studies have not considered ties. This paper introduces the first large scale study of algorithms for rank aggregation with ties. More precisely, (i) we review rank aggregation algorithms and determine whether or not they can handle ties; (ii) we propose the first implementation to compute the exact solution of the Rank Aggregation with ties problem; (iii) we evaluate algorithms for rank aggregation with ties on a very large panel of both real and carefully generated synthetic datasets; (iv) we provide guidance on the algorithms to be favored depending on dataset features. ∗LRI (Laboratoire de Recherche en Informatique), CNRS UMR 8623 Univ. Paris-Sud France †State Key Laboratory of Virology, College of Life Sciences, Wuhan Univ., Wuhan, China. ‡Univ. Bordeaux, LaBRI, CNRS UMR 5800, F-33400 Talence, France §Inria, Montpellier France ¶I2BC (Institute for Integrative Biology of the Cell), CEA, CNRS, Univ. Paris-Sud France ]DIRO (Département d’Informatique et de Recherche Opérationnelle) Univ. Montréal Québec Canada ?Corresponding authors This work is licensed under the Creative Commons AttributionNonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/. Obtain permission prior to any use beyond those covered by the license. Contact copyright holder by emailing [email protected]. Articles from this volume were invited to present their results at the 41st International Conference on Very Large Data Bases, August 31st September 4th 2015, Kohala Coast, Hawaii. Proceedings of the VLDB Endowment, Vol. 8, No. 11 Copyright 2015 VLDB Endowment 2150-8097/15/07.
منابع مشابه
An Artificial Fish Swarm Based Supervised Gene Rank Aggregation Algorithm for Informative Genes Studies
As the widespread use of high-throughput genomic and protein analysis, more and more experiments have been done to identify the informative genes for various diseases, thus it provides researchers an opportunity to aggregate across multiple microarray experiments via a rank aggregation approach. However, most of current’s microarray rank aggregation methods are either unweighted or prespecified...
متن کاملStochastic Rank Aggregation
This paper addresses the problem of rank aggregation, which aims to find a consensus ranking among multiple ranking inputs. Traditional rank aggregation methods are deterministic, and can be categorized into explicit and implicit methods depending on whether rank information is explicitly or implicitly utilized. Surprisingly, experimental results on real data sets show that explicit rank aggreg...
متن کاملRasch analysis of rank-ordered data.
Theoretical and practical aspects of several methods for the construction of linear measures from rank-ordered data are presented. The final partial-rankings of 356 professional golfers participating in 47 stroke-play tournaments are used for illustration. The methods include decomposing the rankings into independent paired comparisons without ties, into dependent paired comparisons without tie...
متن کاملProbabilistic Score Normalization for Rank Aggregation
Rank aggregation is a pervading operation in IR technology. We hypothesize that the performance of score-based aggregation may be affected by artificial, usually meaningless deviations consistently occurring in the input score distributions, which distort the combined result when the individual biases differ from each other. We propose a score-based rank aggregation model where the source score...
متن کاملEfficient Rank Join with Aggregation Constraints
We show aggregation constraints that naturally arise in several applications can enrich the semantics of rank join queries, by allowing users to impose their application-specific preferences in a declarative way. By analyzing the properties of aggregation constraints, we develop efficient deterministic and probabilistic algorithms which can push the aggregation constraints inside the rank join ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 8 شماره
صفحات -
تاریخ انتشار 2015